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NOTE TO READERS 


In December 1974 Scientific Time Sharing 
Corporation held a seminar for its 
APL*PLUSTt Service Representatives. One 
outcome of that seminar was a “wish list" 
of goodies these marketing folks would 
like to have. On that list, they 
expressed a desire for some sort of 

"APL Whizbang" -- a column describing 
neat programming tricks that would illus- 
trate some powerful, but perhaps not 
obvious, features of the language. 


A year later, in delayed response to that 
request, the "Whizbang" was born in News, 
the APL*PLUS Service newsletter. The 
first "Whizbang" illustrated a program- 
ming technique for finding the current 
definition of a name. That was the first 
and last pure "Whizbang". In response to 
jreader requests, the "Whizbang" evolved 
into a more general tutorial on APL pro- 
gramming techniques. 


Over the past two and a half years, the 
reader response to the "Whizbang" has been 
gratifying. Today it remains one of the 
most popular features of News. For those 
who've never seen the "Whizbang" and for 
those who'd like to reminisce, we've 
collected past "Whizbangs" in this handy 
reference booklet. 


Neither the "Whizbang" nor this booklet 
could exist without the devoted efforts 

of Karen Kromas and Beth Matsumune of 

STSC. Many thanks to them -~- for patiently 
translating my barely legible scrawl into 
finished columns -- and to the other STSC 
people who review the drafts. And, most 

of all, thanks to the readers who make it 
all worthwhile. 


Roy A. Sykes, Jr. 
Los Angeles, California 
August 1978 


JAPL*PLUS is a service mark and trademark 
of Scientific Time Sharing Corporation, 
registered in the United States Patent 
and Trademark Offices. 


Copyright © 1978 Scientific Time Sharing 
Corporation. 


Printed in the United States of America. 
All rights reserved, including the right 


to reproduce this document or any portion 
thereof in any form. 
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WHAT'S IN A NAME? 


Finding the current definition of a 
named object in the workspace is useful in 
programs that may or may not reference 
global objects. A common technique, using 
the APL*PLUS primitive [JSIZE, to find the 
storage required is: 


R«O0<QSIZE 'OBJECTNAMES' 
a [JSIZE «> STORAGE REQUIRED 


Unfortunately, this algorithm returns a 1 
indiscriminately for groups and labels, as 
well as for functions and variables. Us- 
ing OIDLOC (object definition) is more 
appropriate, but has the disadvantage of 
producing too much information; we wish 
only to know what the current definition 
is, not that at every level of the state 
indicator. 


Before the scan operator's existence, 
the technique for picking out the proper 
element of A+«Q)IDLOC 'OBJECTNAMES' (in 
ORIGIN 1) was cumbersome: 


R<((-4+(° 1264 )11)6A)0 515 


With the advent of scan, the same funda- 
mental technique was used, incrementally 
improved to: 


R<«((+/A\A= 1)04)031) 


But using scan in another way gives 
us a much more elegant solution: 


R+(,<\A# 1)/,A 


The last solution is fast and succinct, 
and works in either origin. 


The results of these three algorithms 
are the same: 


object is currently 


undefined 
a function 
a variable 
a group 
a label 


ORNE Ol 


Observe that <\ is similar to v\, but 
only the first 1 on each row is preserved. 
Because the global definition of an object 


cannot be 1, we're assured of exactly a 
single 1 in each row of the result of <\. 


One use of this technique is the emu- 
lation of subscripted or triadic APL func- 
tions (A/B or A/[KIB): 


V R<A COMPRESS B;C 


[1] C«,QIDLOC 'COORD' > +(2=(<\C# 1)/C)pA 
[2] R<«A/B O 70 
[3] A:R+«A/[COORD] B O C+[JERASE 'COORD' 

Vv 


V R«B SUB C 
C1] COORD<C © RB 
V 


1 0 1 COMPRESS 3 3019 SUB 2 


ER 
mW 


1 01 COMPRESS 3 3019 SUB 1 


De 525-33 
785 9 


1 0 1 COMPRESS 3 3019 


FR, 
mW 


December 1975 


PERMUTATION VECTORS IN APL —- A RANK ANSWER 


This article shows how one can use 
permutation vectors in APL to rearrange 
city names and population data from sepa- 
rate sources into a convenient report. 


One common characteristic of the four 
expressions: 


S?S O18 0 kV Oo W 


is that they each generate a pegmutation 
vector. A permutation vector of length W 
1S an arbitrary arrangement of iv. A 
permutation vector may be recognized by 
the truth of either of the following ex- 
pressions, which both assert whether PV 
contains exactly one of all its indices: 


A/(ipPV)ePV 
A/PVLAPV]=19PV 


One may also test whether a vector is 
a permutation vector by: 


A/PV=AAPV 


This statement depends on the fact that 
every permutation vector has a unique 
inverse permutation, expressed as APV. If 
PV is indeed a permutation vector, then 
the inverse of its inverse should be iden- 
tical to itself. 


Because permutation vectors represent 
all indices of a vector with pPV elements, 
they are commonly used in selection and 
reordering. For instance, GV+¥V generates 
that permutation which, if applied as 
subscripts to V, would arrange V in de- 
scending order: 


V+?10p99 © V2<V[GV<+¥V] © A/V2=L\V2 


1 
V,C1] GV,[0.5] V2 

85 41 15 40 5 42 80 2 54 78 (V) 

1 710 9 6 2 4 3 5 8 (GV) 

85 80 78 54 42 41 4015 5 2 (V2) 


The particular permutation GV is 
called a descending grade vector. The 
inverse permutation of a grade vector is 
called a ranking vector, which expresses 


the ordinality of the vector: 


V,C1] GV,[0.5] AGy 


85 41 15 40 5 42 80 2 54 78 (V) 
1 710 9 6 2 4 3 +5 8 (GV) 
1 6 8 7 9 5 210 4 3 (AGV) 


AYV is the descending ranking vector 
on V because it Is the permutational in- 
verse of the descending grade vector of /V. 
One might expect then that AAV is the 
ascending ranking vector on V, and that is 
indeed the case. 


Although grade vectors occur more 
commonly, ranking vectors have many uses 
in merging, rearranging, and reporting 
data. For example, suppose we have a 
matrix of city names, NAMES, arranged by 
state, an associated vector EW indicating 


whether the cities are eastern (0) or 
western (1), and two vectors, EAST and 
WEST, containing the population of the 
eastern and western cities. 


NAMES 
LOS ANGELES, CA 
SAN FRANCISCO, CA 
DENVER, CO 
WASHINGTON, DC 
CHICAGO, IL 
BOSTON, MA 
NEW YORK, NY 
PHILADELPHIA, PA 
HOUSTON, TX 

EW 
111000001 


EAST 

757 3367 641 7868 1949 
WEST 

2816 716 515 1233 


We wish to produce a single listing, 
sorted by both city name and population. 
This is almost impossible to do (unless, 
fortuitously, the order of cities and 
populations correspond), because objects 
(lines of the report) can only be arranged 
in one order at a time. 


However, ordering can also le indi- 
cated by a ranking vector, so we decide to 
arrange the report by city name, and ex- 
press the population relationships by a 
ranking vector. The function REPORT uses 
two ranking vectors and one independent 
grade vector. 


VY REPORT;RA;ALP;POP;RANK;GRADE; FORMAT 
C1] a REQUIRES <NAMES,EW,EAST ,WEST>. 
2] ALP+' ,ABCDEFGHIJKLMNOPQRSTUVWXYZ' 
3] RA+pALP © POP+( EAST WEST )CLAAEW] 
C4] RANK+AVYPOP*+POPLGRADE+ARALALP.QNAMES J 
[5] ‘CITY NAME ___ -POP RANK" 
[6] FORMAT+'17A1,X3,24,X2,I3' 
C7] FORMAT AFMT(NAMES(GRADE;1;POP;RANK) 


On line (3] of REPORT we use an as- 
cending ranking vector, AAEW, to merge the 
elements of EAST and WEST: 


EW O AEW O AAEW 
0 0 
7 2 
1 4 


EAST WEST © Q+POP+(EAST WEST )CAAEW] 
757 3367 641 7868 1949 2816 716 515 1233 
2816 716 515 757 3367 641 7868 1949 1233 


POP and NAMES now correspond; hence, we 
must reorder both by the same indices to 
maintain that relationship. 


Line [4] of REPORT performs several 
operations. It generates an ascending 
grade vector by coding the city names into 
numbers and grading up those numbers; it 
reorders POP into ascending sequence by 
city name, saving GRADE so we can later 
reorder NAMES by the same permutation; and 
finally, it generates a descending ranking 
vector on POP to express the ordering of 
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populations, although we will be using the 
ordering of city names to organize the re- 
port. 


O«CODED*RALALPiQNAMES 
2.09224 3.02224 8.95£23 3.59224 7.67#23 
6.62223 2.32F24 2.62824 1.52F24 


O+GRADE+ACODED 
65391782 4 


O+POP+POP(GRADE] 
641 3367 515 1233 2816 7868 1949 716 757 


(¥YPOP),[0.5] RANK+AYPOP 
62574981 3 (YPOP) 
82953147 6 (RANK) 

Line [7] uses GRADE, which we compu- 
ted on line [4] to sort by city name, to 
reorder NAMES in ascending sequence; it 
then uses AFMT to format the report. 


VOILA! ~-- Your very own almanac. 

REPORT 
CITY NAME ~POP RANK 
BOSTON, MA 641 8 
CHICAGO, IL 3367 2 
DENVER, CO 515 9 
HOUSTON, TX 1233 5 
LOS ANGELES, CA 2816 3 
NEW YORK, NY 7868 1 
PHILADELPHIA, PA 1949 4 
SAN FRANCISCO, CA 716 7 
WASHINGTON, DC TST 6 
April 1976 
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SUBSCRIPTION 


This article discusses the use of 
subscription for selecting and changing 
scattered elements of an APL array. 


The APL indexing (or subscription) 
function provides a powerful facility for 
identifying elements of an array. Once 
Identified, those elements may be selected 
(DATA+V(IJ]) or changed (VE[IJ]*DATA). An 
individual element is identified by one or 
more indices, depending on the rank of the 
left argument. Each set of indices suffi- 
cient to identify one element of an array 
is called an index-tuple. The right argu- 
ment is a list of APL expressions repre- 
senting the indices for each coordinate of 
the left argument. The expressions are 
separated by semicolons, and the entire 
list is enclosed in brackets. 


LEFTARG(LR3s13sG3H3T30003A3P3sG] 


If all indices of a coordinate are used, 
the expression (but not the semicolon) may 
be elided for that coordinate. 


The elements that are identified 
represent the cross product (outer prod- 
uct) of all specified indices of each 
coordinate coupled to all specified indi- 
ces of the other coordinates. For in- 
stance, each element of a three-dimension- 
al (3-D) array, 


Q+T+ 2 3 5 pi30 
1 2 3 4 5 


11°12 13 14 15 


16 17 18 19 20 
21 22 23 24 25 
26 27 28 29 30 


must be identified by three indices: 
layer, row, and column. 


[e2 © Rei O C+u 
TCL;R;C] 
19 


If the result of any of the expres- 
sions is not a one-element array, each 
element is conceptually coupled with all 
elements of the other expressions to pro- 
duce an array of index-tuples (in this 
case, each having three elements), 


Ie 210 R#3 0 C+ 125 2 

F+'GM® 7£9;9;9] M! 

F AFMT(Lx100)°.+(Rx10)°.+C 
7023331] %02333;2)] %£0233;5] 7£23;3;2] 
TC13331] £01333;2] 101;3;5] 7£1;3;2] 


which are then used to select the data: 


TCL;R;C] 
26 27 30 27 
11 $12 15 12 


If we wish to select elements scat- 
tered throughout the array, the outer 
product index-coupling generally identi- 
fies more elements than we want. Essen- 
tially, we would like to disable the 


=6= 


(outer product) cross-coupling, leaving 
only (scalar) parallel-coupling. The two 
techniques for doing this are: 


° refine the result of the index func- 
tion, and 


° reduce the rank of the indexed array. 


It is important to note that, for 
vectors, only one index is needed to iden- 
tify each element. Hence, vector indexing 
already can do scattered-point selection 
because no index-coupling is performed. 


V+' BRUTE't O V[6 5 15 4] 
ET TU 


However, for matrices, two indices are 
needed to identify each element. Indexing 
Produces these two-element index-tuples by 
coupling all specified row and column 
indices: 


M<«'SPADE CLUB HEART DIAMOND* 
O+M<« 4 7 pM 

SPADE 

CLUB 

HEART 

DIAMOND 


M[1 2431; 1325 1] 
SAPES 
CUL C 
DAIOD 
HAETH 
SAPES 


Suppose that what we really want in 
the above case is simply a pairwise cou- 
pling of the row and column indices, for 
example: 


MC131],M0233],M04%;2],M03;5],ML1;1] 
SUITS 


Look back at the major diagonal (top 
left to bottom right) of the matrix re- 
sult. Primitive indexing has in fact 
selected what we want, and more. By using 
dyadic transpose, we can select that diag- 
onal, thus achieving the effect of a pair- 
wise coupling of the row and column indi- 
ces: 


11 0M[1 2431; 13 251] 
SUITS 


This technique applies to selecting 
scattered elements out of any array, as- 
suming the index expressions are vectors. 
For A[V1;V2;V3;...;VN] (where N«ppA), one 
would use 


(We1)QALV13V2;V2;...3VN] 
and the result would be a vector of length 


(pV1i)L(pV2)L (pV3)L...LpVN 


If we generalize the problem somewhat 
we find that we can primitively select: 


scattered points (scalar elements) from 
vectors by VIC], and 


scattered lines (row/column vectors) from 
matrices by M[R;] or M[;C], and 


scattered planes (matrices) from 
3-D arrays by TUL33;], T(;:83), or TU3;C]. 


(Elided coordinate indices may be replaced 
by any APL expression.) 


Remember that we can select scattered 
points from a matrix, but only if they all 
fall on a single row or column. The same 
principle applies to scattered points or 
lines in a 3-D array. Essentially 


MC3; 5123 52 4) 
THEATER 


is really only selecting scattered points 
from the vector 


M[33;] 
HEART 

(M[03;])(5 123 5 2 4] 
THEATER 


Conversely, we cannot select: 


scattered scalars from 
matrices or higher-order arrays, or 


scattered vectors from 
3-D or higher-order arrays, or 


scattered matrices from 
4-D or higher-order arrays, 


unless all the selected subarrays are 
Identified by single indices along all 
other coordinates (thus effectively reduc- 
ing the rank of the indexed array). 


In general, if we wish to select more 
than one subarray (e.g., a scalar) from an 
array of rank greater than one higher than 
that subarray (e.g., a matrix), we must 
use a Slicing transpose to obtain the 
result. The following table illustrates 
scattered-array indexing for common cases 
where L, R, and C are vectors: 


Select 

scat- from from from 
tered a vector a matrix 

points 1Q9V([C]* 1 1QM[R;C] 1 1 1Q7T[L3R;C] 
cols 1QVIC]* 1 2QM[;C]* 2 1 2QT[L;;C] 
rows 1QOVC]* 1 2QMCR;]* 1 1 2QT(L;3R3] 

planes 1 2QMC;]* 1 2 3QTCL;;3]* 


(*) The &® may be elided from these expres- 
sions. 


Here's an example of indexing scat- 
tered rows: 


a 3-D array 


J«'JELL JELLO JELLY JELLIED' 
G<'GEL GELD GELT GELATIN! 
O+JG+(4 7 pJ),[0.5] 4& 7 pG 

JELL 

JELLO 

JELLY 

JELLIED 


GEL 
GELD 
GELT 
GELATIN 


112 0/Gf111223; 14 34 1 5] 
JELL 
JELLIED 
JELLY 
GELATIN 
GEL 


The transpose technique is concise, 
but it has several disadvantages: 


° For large right arguments (the indi- 
ces) it requires a great deal of tem- 
porary workspace storage and extra 
computer time to select all the super- 
fluous data before the transpose is 
performed. 


° Altering the shape of the result ne- 
cessitates potentially confusing 
changes to both the indices and the 
left argument to transpose. 


° It does not allow specification. 


An alternative technique is to reduce 
the rank of the indexed array such that we 
can use primitive indexing. This requires 
that we somehow recalculate the indices to 
correctly identify the desired data. In 
the preceding example (selecting scattered 
rows): 


(8 7 pJG)[1 4 3 8 5 3] 
JELL 
JELLIED 
JELLY 
GELATIN 
GEL 


The indices were actually calculated by: 


IND*+ 11122 ,f0.5] 14341 
O«IND<(11)+( 1497G)LIND-11 
14385 


Notice the dependence on the ORIGIN (11) 
to offset the indices when working in 
ORIGIN 1. Number systems (which is actu- 
ally what 1 deals with) are based in 
ORIGIN 0. 


A more common case is selecting scat- 
tered points from an arbitrary array. 
Since only vector indexing primitively 
allows selection of scattered points, we 
must reduce the rank of the array to l 
(e.g., ravel it) and calculate the sub- 
scripts (assume ORIGIN 1): 


Ie 1111222212222 2 2 
Re tit1244 42242321 
C+ 123476755 74 71 2 3 
A JEL OS INTO A GEL 


(,JG)(1+(psG)1(L,01] F,[0.5] C)-1) 
JELL INTO A GEL 


It may be clearer if we look at rav- 
elled JG (after removing spaces), its 
indices, and the indices we calculated: 


Be! '2eV¥<,JG © Q'T2' APMT B/ipB © B/V 

£111111112222222233333344445555555 
123489012567892345678901678934560123456 
JELLIJELLOJELLYJELLIEDGELGELDGELTGELATIN 


123 4 42 55 56 54 12 42 53 42 43 37 31 


Notice how the base value operation 
(1) had the effect of multiplying LZ by 28 
(number of elements in each layer), R by 7 
(number of elements in each row), and C 
by l. 

In fact, we can write a generalized 
scattered-point selection function as 
follows: 


VY RA SPS B;C 


[1] a SCATTERED-POINT SELECTION. 

[2] a <A> IS THE ARRAY TO BE SUBSCRIPTED. 
[3] a <B> IS THE ARRAY OF COORDINATE 

[4] a INDICES IN <A> SUCH THAT ITS LAST 
[5] a COLUMN REPRESENTS COLUMN INDICES, 
[6] a THE NEXT-TO-LAST (IF ANY) IS ROW 
[7] a INDICES, THE NEXT IS LAYERS, ETC. 
[8]  CONFORMABILITY: (~1+1,pB)e1,pp0A 

[9] a ORIGIN DEPENDENT. (pR)= 1+pB 


[10] RepA > CH''11 
[11] R+(,4)0(R1(1019pB)9B)+C-R1iC] 
V 


A scattered-point assignment func- 
tion, utilizing a global to hold the data, 
might be defined as: 


V R+A SPA B;C 
[1] a SCATTERED-POINT ASSIGNMENT. 
[2] a SIMILAR TO <SPS> BUT THE RESULT IS 
£3] A THE LEFT ARGUMENT <A> MODIFIED BY 
C4] a THE GLOBAL <DATA>. 
[5] R+,A4 © C¥«''11 © A+pA 
[6] R((AL(1019pB)QB)+C~ALCI*<DATA © R+ApR 
V 


In both of these functions, the ORIGIN 
offset is calculated in a slightly differ- 
ent fashion to maximize efficiency for 
large index arrays. 


A few examples should show the power 
of SPS: 


+M<« 5 7 p1r35 
1 2 3 4 5 6 7 
8 9 10 1112 13 14 
15 16 17 18 19 20 21 
22 23 24 25 26 27 28 
29 30 31 32 33 34 35 


MSPS 52pii4, 14, 22, 3 2, 4 4 
14 9 16 25 
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M SPS 2,[1.5]15 A ROW 2; COLS 1-5 


8 9 10 11 12 


M SPS(15),[1.-5] 2 a ROWS 1-5; COL 2 


29 16 23 30 


M SPS § 1 915 a LAST COORD. EXTENDED 


19 17 25 33 


ROWS<«1,[1.5]15 © COLS<(15),[1.5]7-15 


0 2 0 *FROWS,COLS 


5 
1 
2 
3 
4 
5 


PRE PR 
OFEFwWNE 
NWOFUD 


0 ¥M SPS ROWS,(2.5] COLS 


OFwWoONrR~ 
ra 
ND 


The function SPA might be uSed as 
follows: 


DATA+ 2 
O«+M«M SPA 5 2p 11,1 4,2 2,3 2,4 4 


“2 2 372 5 6 7 
8 2 10 11 12 13 14 
15 ~2 17 18 19 20 21 
22 23 24 ~2 26 27 28 
29 30 31 32 33 34 35 


DATA+ ~3 °3 4 75 

O+M<«M SPA 4 2021, 45, 32, 54 
2 3°2 5 6 7 
210 11 12 13 14 

15 4 17 18 19 20 21 
3 
0 


22 23 24 2 °3 27 28 
29 30 31 5 33 34 35 
DATA+ 4 4 p14 
Re 1222333344 445555 
C+ 6357146713572567 
rs O+M+M SPA (4 4 pR),£2.5] 4 4 pC 
2.2 3°2 5 1 7 
32 211 313 4 
17417 219 3 4 
123 2 °2 327 +4 
29 131 °5 2 3 4 
June 1976 
-ll- 


SOME NOTES ON DATE STORAGE 


The processing of calendar dates 
often involves fairly complex conversions 
between the stored representation and what 
is presented to the user. Typically, 
dates are stored in one of two formats, 
both designed to allow the representation 
of a date as a single number: 


e Ordinal Representation 
Dates in this form are stored as inte- 


gers indicating the number of days 
elapsed since a certain base date. In 
particular, the APL*PLUS File Subsys- 
tem stores OFRDCI timestamps as ordi- 
nal values relative to March 1, 1960. 
Given a file timestamp 7S, the number 
of elapsed days can be calculated by 
LT7S+5184000 (5184000 = the number of 
sixtieths of a second in one day). 


° Packed Representation 
Dates in this form are stored as 
100LYEAR,MONTH,DAY. The form is com- 
monly abbreviated to YYMMDD, express-— 
ing only dates in the twentieth cen- 
tury. Today's date represented as a 
packed number can be calculated by 

1000000/10013t07S 

760714 a BASTILLE DAY 


Both representations allow qualita- 
tive analysis among dates. That is, dates 
stored in either format may be compared 
and ordered directly. However, quantita- 
tive analysis (e.g., "How many days be- 
tween ...") is possible only with the 
ordinal representation. 


Ordinal dates are not as commonly 
used as packed dates because they are 
significantly more expensive to process, 
particularly in the complex conversion 
necessary to display them in a meaningful 
format. Frequently, dates are stored in 
packed format, and occasionally converted 
to ordinal format for quantitative pro- 
cessing. 


Dates also have a variety of display 
formats, Such as: 
7/4/76 SEPTEMBER 2, 1945 31 MAY 69 
The standard format used in columnar re- 
ports is MM/DD/YY (although in Europe the 
accepted form is DD/MM/YY). Assuming our 
dates are in a vector DATES in YYMMDD 


form, how can we most efficiently display 
the standard format? 


The most obvious approach is to un- 
pack the dates into three numbers each, 
reorder them, and format each column: 


D<«¥19 100 100 100 TDATES 
'T2,0/0,272,0/0,222' OFMT D 


We can reduce the number of format 
phrases, and speed up OFMT's operation, by 
the following: 


'T2,2PQ0/0213' OFMT D 


~l]2- 


To really speed up the process, how- 
ever, we should consider formatting only 
one number per date. This can be done by 
repacking the dates in the desired order, 
and using the G format phrase to insert 
the slash (/) decorations: 


D<10011e 100 100 100 TDATES 
'GM™Z9/99/99M' [FMT D 


The reordering can be speeded up even 
more by: 


D<«10016 0 10000TDATES 


which reduces the required number of in- 
ternal arithmetic and data-movement opera- 
tions. 


A disadvantage of the above tech- 
niques is that they all require a large 
amount of intermediate storage; in fact, 
they may temporarily require up to six 
times the working area for each date! The 
solution is some clever algebraic manipu- 
lation of the digits within each date. 


Here's a symbolic illustration of 
what we want: 


MDD 
MMDDYY 


l. Let's start by picking off the month 
and day: 


10000] YYMMDD 
MMDD 


2. Multiplying that by 1000000 (6 digits) 
produces: 


1000000xMMDD 
MMDDO000000 


3. To which we add the original dates: 


YYMMDD+MMDDO000000 
MMDDYYMMDD 


4. Now all we have to do is divide by 
10000 (to eliminate the trailing four 
digits that represent MMDD): 


MMDDYYMMDD+10000 
MMDDYY .MMDD 


5. and floor the result: 


LMMDDYY .MMDD 
MMDDYY 


It could be written in one statement: 
L (YYMMDD+1000000x10000|YYMMDD)+10000 


There's yet a further simplification 
that will eliminate the L, and allow the 
whole calculation to proceed in integer 
representation. The fractional part of 
step (4) is exactly (10000|YYMMDD)+10000. 
Given MD<10000|YYMMDD, the statement above 
becomes: 


L (YYMMDD+1000000xMD)+10000 
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which is equivalent to: 
( (YYMMDD+1000000xMD)+10000)-MD+10000 


because of the fraction in step (4). Now 
distribute the denominator (10000): 


(( YYMMDD+1000000xMD)-MD)#10000 

Reassociate the numerator: 
(YYMMDD+(1000000xMD)-MD)+=10000 

Add common terms (distribute MD): 
(YYMMDD+999999xMD)+=10000 

and expand MD: 
(YYMMDD+999999x10000|YYMMDD)+10000 


This statement requires one-third the 
dynamic storage of its equivalent: 


10016 0 10000 TDATES 
and executes up to three times faster! 
As an exercise, try applying these 
principles to converting dates input in 


MMDDYY form to their YYMMDD representa- 
tion. 


August 1976 
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STRETCHING THE WORKSPACE 


",.. programs execute faster in less 
space". Those are sweet words to APL 
users. They mean functions will consume 
fewer CPU cycles, take less connect time, 
and require less workspace storage. The 
net effect is lower cost and increased 
utilization of the APL workspace. 


That third benefit is important. 
Workspaces hold only a limited amount of 
information. They must accommodate func- 
tions, variables, groups, a symbol table 
(to store object names), execution stack 
information (the state indicator), and 
other internal directories. Moreover, 
during execution of expressions, the 
active workspace is continually allocating 
and freeing storage for temporary values. 


By making efficient use of the free 
working area of the active workspace (mea- 
sured by UWA), the APL*PLUS System reduces 
the incidence of WS FULL errors. The user 
can therefore store more programs or pro- 
cess more data than is otherwise possible, 
thus leading to additional benefits in 
programmer productivity, more efficient 
execution, and reduced internal overhead. 


There are four fundamental, as well 
as several specialized, mechanisms cur- 
rently employed by the APL*PLUS System to 
conserve available workspace storage 
resources. All are invoked automatically, 
and generally have no immediately apparent 
effect on running programs. Without them, 
however, usable working area would be 
drastically reduced, and WS FULL would be 
a recurring nightmare to programmers. 


When executing a function, the APL 
System typically requires space for both 
the argument(s) and the result simultane- 
ously. For example, during execution of 


R+(VEC#0)¢MATL32 3 4] 


the system will, at one instant, require 
space for the temporary value VEC+0, the 
other temporary value MAT[;2 3 4], and the 


result of the compression. Essentially, 


TEMP1<VEC#0 © TEMP2+MATL;2 3 4] 
R«TEMP1/TEMP2 © OERASE 'TEMP1 TEMP2!' 


Of course, VEC and MAT are held in the 
workspace during the entire process. 

Given this general rule, let's see how the 
system tries to conserve space. 


1. In-place storage 


Often the system recognizes that the 
result of an operation fits into the space 
used by one of its temporary arguments, 
and is "smart" enough to use that same 
area to store the result. For instance, 


A«+\1100 
requires space only for 1100; the +\ is 


subsequently stored over that temporary 


-15- 


value. This effect would not take place 
for A«+\A because the argument to scan is 
not a temporary value. 


Other examples of in-place storage 
are: 
R+AC$150) a BOTH FOR 9 AWD [] 
R+101p2+1100 a BOTH FOR p AND + 


Some functions, like reversal and reshape, 
normally make copies if there is room in 
the workspace, and resort to in-place 
storage and slower algorithms only if 
necessary to avoid a WS FULL condition. 


2. Chained variables 


Chained variables can result in 
significant savings when using defined 
functions with named arguments. Most 
APL systems materialize often super- 
fluous (wasteful) copies of named argu- 
ments; the APL*PLUS System defers copying 
until necessary -- sometimes not at all! 


The following function returns an 
ascending grade vector on its numeric 
matrix argument: 


VY R*GRADE MAT;I0O;COL 
C1] TO+''i1 © Rei (pMAT) (IO) 
C2] >+(COL+(pMAT)[IO+1])+¥0  COL+COL-~IO 
C3] 4:R+RCAMATCR;COL)]) © +4xIOsCOL+COL-1 
v 


Given M+? 100 20 p100, executing 

GV+GRADE M_ requires OWA to be slightly 
more than 1400 bytes. The traditional 
function-calling mechanism would have 
required OWA to be more than 9400 bytes to 
avoid a Ws FULL. 


3. Datatype conservation 


Numbers stored in binary, integer, or 
floating-point format require 1, 32, or 64 
bits per number, respectively. Several 
enhancements in the APL*PLUS System deal 
with conserving storage by maintaining 
datatype wherever possible. For example, 


60%#16 © 2*131 0 !112 


all return 4-byte integer results, rather 
than the 8-byte floating-point results 
given by other APL systems. Similarly, 
dyadic x, *, [, L, |, and ! operations on 
Boolean arguments preserve the binary 
datatype. Also, indexed assignment 
attempts to maintain the datatype of the 
target variable whenever possible. For 
example: 


A+100p0 © ALJ+200 a COS OF 0 RADIANS 


preserves A as binary, even though 200 is 
a floating-point 1. 


4. Name referencing 
Several identity operations perform 
no actual data movement in the workspace 


at all. Instead they pass internal point- 
ers to the right argument. This occurs 


=16— 


for both named and temporary arguments. 
For instance, the following expression 
takes almost no workspace storage beyond 
that required for CV: 


CV*+80000p'ABC! 
CV+(pCV)p¥1/,2'cV[]' 
a AA AAA A PASS POINTERS 


A particularly common use is functions 
that ravel arguments which are predomi- 
nantly vectors anyway. 


5. Specialized mechanisms 


Besides the techniques described 
above, several others are used when speci- 
fic syntactic constructions are encoun- 
tered. For instance 


'IT80' OFMT 10000p1 


requires only 1500 bytes for execution, 
even though the "result" would seem to 
reguire 800,000 bytes. Similarly, 


BV/1pBV 


requires space only for the result. In 
the case where BV+ 500000t1, OWA need be 
only 76 bytes to execute the expression, 
rather than the over 2 million bytes that 
might otherwise be necessary. Another 
special case is *+0LC, which requires only 
40 bytes for execution, regardless of the 
size of the state indicator. In fact, 
+OLC may work when OLC alone engenders a 
WS FULL error! 


Of course, a large workspace always 
helps. As recently as 1972, an APL*PLUS 
workspace was limited to 32,000 bytes -- 
now it's over 100,000! Theoretically, we 
could allow even larger workspaces. But 
keeping workspaces reasonably-sized and 
making maximum use of that storage 
increases the throughput of the APL*PLUS 
System, and assures maximum machine utili- 
zation, economical operation, and the good 
response time which users depend on. 


The ability to bring data and func- 
tions in from files, to dynamically 
expunge objects, and to automatically load 
other workspaces also helps the user avoid 
WS FULL errors. And, of course, there's 
always )ERASE! But the automatic mecha- 
nisms used by the APL*PLUS System are the 
silent sentries in stretching storage. 


October 1976 
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BRANCHING AND ITERATION 


This article discusses some guide- 
lines and techniques for branching and 
iteration in APL. Many people have asked 
me, "What's the fastest way to branch?" 

I hope to shed some light on this often- 
maligned subject. 


Guidelines for Branching 


The following three guidelines will 
simplify the maintenance of your code and 
improve its readability. The techniques 
you use are not as important as how con- 
sistently you use them. 


1. Branch to labels, not line numbers. 

If your branch targets are line numbers 
rather than labels, simply inserting or 
deleting lines from a program can become a 
heinous task. Furthermore, minor benefits 
in storage and execution speed are gained 
by using label variables rather than con- 
stants. 


2. Adopt a consistent notation for 
labels. Labels like LOOP, AGAIN, DO, 
MORE, and OOPS seldom add clarity to pro- 
grams, especially large programs. Fur- 
thermore, they give the reader little or 
no clue as to where in the function they 
refer, or whether they are in fact labels. 
Common notations are {£1;12;L3;...} and 
{A4;B;C;...}. More mnemonic conventions 
can help the reader recognize program 
segments (for example, LP1 or ER2). The 
notation or convention that you choose is 
irrelevant as long as you adhere to it. 


3. Adopt consistent techniques for 
branching and iteration. For instance, 
some people favor 


+LABELx 1 condition 
while others use 

+(condition) pLABEL 

Again, it's not so important which 
techniques you use (except for trivial 
differences in execution speed), but that 
you use them consistently. Among my per- 
sonal favorites are: 


+(condition)pLABEL or +(condition)+LABEL 
+(xpvector )pLABEL or +(pvector)+LABEL 


+(conditions)/LABEL or +(A/conditions) pLABEL 


+LABELxcondition or +LABELx~condition 


1. Subroutines to handle branching logic. 

Programs can often be made more read- 
able by using functions with mnemonic 
names for branch calculations. For 
instance, 


O *THISL IF M2C+C+1 
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is certainly more understandable than 
© +(M2C+C+1)t0LC 


or 


iS 


© e20 .M2C+C+1 


You can define the functions IF and 
THISL as: 


V R+A IF B V R«THISL 
Bp ta! R+B/A C1] R+1+1+0L¢ 
Vv v 


IF can also be used for multiple 
conditions. For instance, 


>+(L4,L5) IP “1 1=xA 


will branch to L4 if A is negative, Ls if 
A is positive, and fall through if A is 
zero. 


These functions are especially bene- 
ficial in programs with complex branching 
logic and many transfers of control. How- 
ever, in simple iteration (see below) they 
exact a small but noticeable penalty in 
CPU resources. 


2. Faster techniques for iteration. 


The following techniques, while quite 
fast, generally detract from readability 
and should therefore be used judiciously. 


Typically, loops involve a leading 
test as exemplified by ITERATE. 


VY ITERATE M;C 

{1] SETUP O C+1 

[2] +0 IF C>M © PROCESS 0 C+C+1 © +0LC 
v 


We can produce a speedier version of 
ITERATE by eliminating the IF subroutine 
and using a label. 


V ITERATE2 M;C 

C1] SETUP 9 C+1 

[2] A:*+(C>M)p0 > PROCESS > C+C+1 O% +4 
v 


By employing a trailing test, the 
branch overhead can be reduced signifi- 
cantly. The once-executed leading test, 
+M+0, accommodates the zero case. 


VY ITERATE3 M;C 

Bp SETUP > *M+0 O C¥1 

[2] 4:PROCESS © +(M2C+C+1)pA 
v 


Further, by precalculating all branch 
points, we can avoid the serial iteration- 
by-iteration testing. APL permits (and 
even encourages) parallel processing on 
data -- why not parallel calculations, 
testing, and branch-point determination? 
ITERATE4 illustrates a simplified form of 
this technique. 


== 


VY ITERATEY M;C3L 
[1] SETUP © C+1 © +L<(MpA),0 
[2] A:PROCESS © +L[C+C+1} 

Vy 


Note that all of the above algorithms 
correctly handle the case where zero iter- 
ations are requested. They are also re- 
Startable; that is, if PROCESS produces a 
WS FULL or VALUE ERROR, the problem can be 
corrected, and *ULC will resume execution 
normally. It is true that ITERATE2 could 
have been shortened to 


[1] SETUP & C+«0 
C2] A:+(M<C<C+1)p0 © PROCESS © +A 


However, resumption after a VALUE ERROR on 
PROCESS would have incremented C twice 
(once before the error, and once after the 
restart), thereby skipping one iteration. 


The following table illustrates the 
relative CPU timings of the above tech- 
niques with varying numbers of iterations. 
The ratios are to ITERATE4; SETUP and 
PROCESS were set aS empty matrices. 

Iterations (M) 


eee weee. cage, eee 2o0e 
ITERATE tT AeeG: (NT S2b0\.- 9289 
ITERATE2 .90 1.08 1.45 1.67 1.68 
ITERATE $65 9 SO- EOS! ANG) -d.lt 
ITERATE 1200) 100: 2.00 “4.00 “2.208 


(Executed 11/29/76 at 7:27 P.M. EST 
on APLPLUSC with 20 users) 


3. & Recursed! 


Did you know that you can use the 
execute function (2) to effect iteration 
in desk calculator mode? Assume M is the 
number of iterations (M21), and C is the 
counter (initialized as C+1). Now compare 
the following two lines -- one in a func- 
tion, the other in desk calculator mode: 


(W¥] lL: PROCESS © +(M2C«C+1)/L 
2L<'PROCESS © #(M2C<C+1)/L' 


Both have the effect of executing PROCESS 
as many times as called for by M. Observe 
that whereas the first line uses branching 
to the label L, the second uses ¢£ to 
recursively execute the Statement repre- 
sented by ZL. Unpleasantly, the execute 
construction consumes significantly more 
CPU time and is far more extravagant in 
its use of workspace storage. It does, 
however, illustrate the interesting rela- 
tionship between > and £. 


January 1977 
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STRING SEARCHING 


The ability to rapidly locate string 
occurrences in other strings is often 
useful. The many techniques applicable to 
the problem also provide enlightening 
examples of efficient problem-solving in 
APL. This article discusses several 
algorithms for string searching to illus- 
trate efficiency of execution and some 
interesting techniques. 


A string is a vector, typically 
character. String searching means locat- 
ing the occurrences of one string, called 
the substring, in another string, called 
the target string. The result can be a 
Boolean vector of the same length as the 
target string with 1's denoting the 
leading element of each match. More 
commonly (see each function below), the 
result is the origin-sensitive indices of 
the 1's in the Boolean vector (i.e., 
bV/.1pBV). Overlapping matches are possi- 
ble. In all functions, the left argument 
A is the target string and the right 
argument & is the substring for which we 
will search. 


The most fundamental APL solution is 
quite straightforward: 


VY RA SS1 B3C3D38 
C1] C«pA+,A > DepB+,B > E+O0fC-0fF D-1 
[2] Re (Bta/((1D)-11)04°.=B)/1E 
[3] a OR ALTERNATIVELY, 
C4] aR<(BEta4((1D)-11)0B°.=4)/1F 
Vv 


All characters in the target string are 
compared with all characters in the 
Substring, the logical matrix is shifted 
to align each potential occurrence, and 
the indices of complete matches are 
selected. Notice that if the length of 
the substring B is greater than that of 
the target string A, then no matches are 
possible. Furthermore, if the substring 
is empty, all indices are returned. The 
number of potential matches, EF, is nor- 
mally 1 plus the difference in lengths of 
the strings, that is, &+1+C-D or E«C-D-1. 
The two Of 's are to correctly handle empty 
or overlong substrings. 


A somewhat more elegant solution 
employs inner product to detect matches: 


V RA SS2 B;C3D3E 
(1) C+pA+,A © D+pB+,B © B<0fC-0fD-1 
[2] R«(EtBa.=(D,C+1)pA)/i& 
[3] a OR ALTERNATIVELY, 
C4] eaR«CEtBa.=((1D)-121)O0(D.C)pA)/1E 
Vv 


Rather than rotating a logical array, this 
technique builds a matrix of all possible 
matches and then uses A.= to select those 
that are complete. Observe the pleasant 
effect of the reshape on line [2] -- it 
both builds and shifts the matrix. S52 is 
somewhat faster than SS1, although it 
requires more workspace area (UWA) in 
which to execute. 
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Unfortunately, both of these algo- 
rithms are inherently inefficient because 
they must compare all possible combina- 
tions of characters in the two strings; 
that is, they are "blind", and continue to 
process even if no matches are possible. 

A somewhat better approach is to determine 
the indices of the first character, and 
compare only the remaining candidates with 
the rest of the string: 


V RA SS3 B3C3D38 
[1] C+pA+,A © D*epB+,B > +BLAtD 
[2] A:R«iC > +0 A EMPTY STRING 
[3] R«(Ae€B)/1C © +0 a SINGLETON STRING 
C4] B:8&+0C-D-1 © + (pR«(Etde1tB)/i1F)+0 
[5] Re (ALRo.+( 1*11)41D]A.=14B)/R 

Vv 


This function employs some new techniques 
worth discussing. Notice the unusual 
branch on line [1] which has the effect of 
trapping both the empty and one-element 
cases. If desired, line [2] could then be 
modified to return 10 in the empty case 
rather than all indices -- a more usable 
result. 


Line [4] calculates the number of 
potential matches, and locates the occur- 
rences of the first character. (For 
character arguments, « is typically faster 
than = because of the internal algorithm 
used.) If the first character is not 
found, the function exits immediately with 
an empty result. Otherwise, a matrix of 
all remaining candidates is built via 
outer product and subscription, and A.= is 
used to produce the compression vector 
which selects only complete matches. The 
(“1*11)41D construct cleverly accounts for 
the index origin while removing the index 
of the character already examined. 


While S5S3 is a considerable improve- 
ment over SS1 and SS2, it is still rela- 
tively naive. For instance, if the first 
character of the substring occurs fre- 
quently in the target string, little or no 
savings may accrue. Furthermore, the risk 
of a WS FULL is increased because of the 
larger space required by the matrix of 
indices. A more sophisticated algorithm 
employs basically the same technique, but 
attempts to minimize the number of initial 
“hits" by searching for the character in 
the substring that has the least number of 
expected occurrences: 


V RA SSH B3C3D3E3F3G 
C1] C+pA+,A © D+pB+,B O +BLAtD 
[2] A:R«iC 0 70 aA EMPTY STRING 
[3] R«(AeB)/1C © +0 a SINGLETON STRING 
C4] B:F+«' ETAISONRLCFHUDMPBWYGKVJQXZ.,' 
[5] E«O[lC-D-1 © GeFiB © F*Gil/G 
[6] +(pR+(#£+(F-11)+AeBLFI])/1£)+0 
{7 ] G+ (F*1D)/1D 
[38] R+(ACRe.+G-11]a.=BLG])/R& 
Vv 


Line (4] specifies a letter distribution 
vector arranged in descending order of 
expected frequency in normal text. By 
line [6], F has become the index in the 
substring of the letter with the fewest 
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probable occurrences in the target string. 
This letter is used for the first compari- 
son. Its index is then removed on line 
[7] for the outer product calculation of 
indices and A.= comparison on line [8}. 


Since we have concluded that reducing 
the number of potential hits on one 
character is beneficial, why not do it on 
two or three? In fact, why not conclude 
the entire process by constantly refining 
the potential matches until none remain, 
Or until we have exhausted the substring? 
SSS does just that: 


V AA SSS B3C3D3k 
[1] C+pA+,A © DepBb+,B > +BLAt+D 
[2] A:R«1iC © +0 A EMPTY STRING 
[3] h+(Ae€B)/1iC © +0 a SINGLETON STRING 
C4] B:E*O[C-D-1 9 +(pR+(EtAe1t+h)/1E) 40 
[5] B+ (1C0+1)+8 
C6] C:+(pR+(ALR+CJ]eBL[C]1)/R) +0 
[7] +CxD>C+C+1 
Vv 


Through line [4] the function is 
identical to S53; that is, both empty and 
one-element strings are accommodated, and 
the first character of longer Strings has 
been compared. Line [5] initializes the 
substring element counter, and modifies B 
so that the counter is appropriate in 
either origin. Lines [6/ and [7] compose 
the loop; on line [6] the Cth character 
beyond existing matches in the target 
string is compared with the Cth character 
of the remaining substring. If no matches 
remain, the function terminates. Other- 
wise, the counter on line [7] is incre- 
mented and the function branches back to 
line [6] if there are more characters to 
compare in the substring. 


In most cases, SS5 is highly effi- 
cient. It slows down somewhat if the 
substring is quite long because it must 
iterate on each element. On the other 
hand, its antecedents are all dangerously 
prone to WS FULL problems in similar 
situations, especially if the target 
string is also long. However, WS FULL 
errors are still possible in SS5, particu- 
larly if the first character of the 
substring occurs often in the target 
string. This situation can be remedied in 
much the same way as was illustrated in 
the progression from SS3 to SS4: 


V RA SS6 B3C3D3E3F3G 
[1] C+pA+,A > D+pB+,B © +BLAtD 
[2] A:Rkei€ > 70 a EMPTY STRING 
[3] R«(AeB8)/iC O +0 a SINGLETON STRING 
C4] B:F+' ETATSONRLCFHUDMPBWYGKVJQXZ.,' 
[5] E+OfC-D-1 © B+B(G+VFiB] 
[6] G+G-F+*1C+l 
[7] >+(pk+(EtGLFI]+4eB(F1])/1F)+0 
[83 G+F+G © BeF+B 
C9] C:+(pR+(ACR+G([CO]JeB(C])/R)+0 
[10] +C€xD>C+«C+1 
Vv 


An expected letter distribution vector is 
used to rearrange the elements of the 
substring so that the fewest possible 
Matches are likely for each iteration. 
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Besides reducing the expected storage 
requirements, this technique normally 
speeds up the search further because fewer 
comparisons need be performed. 


Some conclusions can be drawn from 
the above discussion: 


° the shortest solution is not necessar- 
ily the fastest; 


° handling special cases as such is 
often beneficial; 


° iteration can be faster than "closed- 
form" code; and 


e intelligent analysis of an algorithm 
can yield substantial gains in CRU 
efficiency and workspace conservation. 


In case you're dubious, the following 
timing comparisons should dispel any 
doubts. The target string is the variable 
INSTRUCTIONS (3683 characters) from 
workspace 1 FILEAID on the APL*PLUS 
System. The times are in CPU millisec- 
onds; the origin is 1. 


B pk 

SUBSTRING HITS $51 352 $53 S54 S55 556 
me 3683 19 19 2 2 2 2 
st 783 85 60 8 8 8 8 
"w! 0 82 56 2 4 2 2 
: ‘ 342 164 105 27 28 14 14 
'AN' 33 163 104 10 9 7 7 
‘aw! 0 162 103 3 4 3 4 
 PHE® 20 262 204 55 9 14 8 
'THE * 16 261 203 14 3 93 9 
"MATRIX' 5 -357 300 15 5 10 9 
"RATMIX' 0 356 301 23 5 9 5 
' PERMITS' 15 456 403 113 #17 «18 13 
"PERMITS ' 15 455 401 16 17 12 13 
"NOTFOUND' 0 454 399 22 19 7 7 
‘ae10Uup~w' 0 455 399 3 4 4 4 


(Executed 2/25/77 at 9:19 P.M. EST 
on APLPLUSC/470-10011 with 8 users) 
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TIMING ALGORITHMS 


APL is a lovely notation for express- 
ing algorithms. Its rich set of primi- 
tives provides a wealth of identities -- 
alternative ways of formulating a problem. 
In a theoretical or pedagogical environ- 
ment, these formal equivalences allow us 
to write expressions and derivations in 
the manner that seems most appropriate. 
This use of APL as a mathematical notation 
preceded its implementation as a computer 
language by several years. 


When APL was implemented in 1966, one 
could, by simply assigning values to 
variables and entering an expression, have 
a computer evaluate it and return the 
result. Henceforth, rather than being an 
abstract notation, APL became widely known 
as a computer language. In such a practi- 
cal environment, users soon discovered 
that alternative formulations of the same 
problem often took widely disparate 
amounts of computer time to execute. 

Since computer time costs money, users had 
a definite interest in knowing the rela- 
tive costs of alternative algorithms. And 
so were born timing programs. 


A timing program simply measures the 
time required to perform an algorithm. 
The elapsed time, or the CPU time, or both 
can be measured. However, the elapsed 
time is usually ignored for two reasons: 
(1) it varies due to unpredictable exter- 
nal influences (such as what else the 
computer is doing at the moment), and (2) 
its cost is typically insignificant as 
compared to CPU time. On most large-scale 
computers today, the cost of connect time 
is typically less than one percent of 
equivalent CPU time. 


Thus, it is CPU time we wish to 
measure, and the system function JAI 
(accounting information) is our tool. Its 
second element is the CPU time accumulated 
since sign-on, which is measured in 
thousandths of a second (milliseconds). 

If we wish to measure the CPU time used to 
perform an algorithm, we first record the 
time, then execute the algorithm, then 
note the time again and subtract. 


V+?40p1000 © M+? 40 4O 9100 


NAIC2) 
27 

R+VEM 

QAIC2) 
164 

164-27 
137 


Thus, it took less than one-seventh of a 
second to solve a set of 40 linear equa- 
tions with 40 unknowns. 


Alternatively, to avoid reentering 
desk calculator mode twice (which itself 
takes a small amount of computer time) we 
could perform: 
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CPU+UAIL2] © R«VEM © OAIL21-CPU 


131 


Two problems are inherent in all 
measurements: (1) the accuracy of the 
measuring device, and (2) the impact of 
the measuring device on the process it is 
measuring. Observe that if we repeat the 
test above, the time may differ slightly. 


CPU<UAICL2] © R«VEM © OAIL2)-CPU 


133 


This phenomenon is normal and reflects 
imprecise timing resolution within the 
computer. We can improve the statistical 
reliability of our meaSurement by perform- 
ing the algorithm several times, and 
dividing the total time by the number of 
iterations. 


¥Y CPU+FOR N;TIMES 
[1] CPU<DAI © TIMES<1 
[2] A:+(TIMES>N)pB 
[3] R+«VEM 
C4] TIMES+TIMES+1 © +A 
(5] B:CPU+(QAI-CPU)(21+N 
v 


FOR 10 
131.4 

FOR 10 
131.2 


If the algorithm takes a very small 
amount of computer time, as in the follow- 
ing example, 


CPU<QAIL2] © R«99+11300 © OAIL2]-CPU 


CPU<QAI(L2)] © R+994+11300 © DAIL2]-CPU 
1 


the coarse resolution of the timer makes 
it impossible to obtain reliable measure- 
ments. Thus it is essential to repeat the 
algorithm many times just to accumulate a 
meaSurable amount of time. 


VFORL3] R<«99+113009 


FOR 100 

1.84 
VFOR[3] R<99+11300V9 
FOR 100 

1.84 


While FOR improves the accuracy of 
our test, it imposes a measurable overhead 
of its own, that of the testing and 
looping time. If we remove line [3] and 
rerun FOR, 


VFOR[~3]V 
FOR 100 
0.32 


we discover that the timing algorithm 
itself uses one-third of a millisecond per 
iteration. Thus, we should subtract this 
overhead time from the result. This tech- 
nique is used in workspace 303 RASMARK, 


A better and more general approach is 
illustrated in the following algorithm. 


96 = 


V R«A TIMES B3C3D;RUN 
[1] acCPU FOR A ITERATIONS OF EXPRESSION B 
[2] C+'O'*LB O C14 (Axpl)pc 
[3] C+ODEF 'VRUN' ,OTCWL,'[1]',c,'V! 
C4] R+OAI © RUN © R<OAI-B 
[5] C<ODEFL 'RUNC1]',1+4Ap'9O! 
[6] D+OAI © RUN © D<OAI-D 
[7] R<«(R-D)(2]1,4 

Vv 


TIMES accurately isolates the time 
required to execute the given algorithm. 
For instance, 


O<T<10 TIMES 'R<VEM' 
1306 10 

+/T a PER ITERATION 
130.6 

+\100 TIMES 'R<«99+11300' 
147 1.47 


Line [2] replicates the expression 4 
times, inserting diamonds between each 
instance. This vector is converted into a 
function, and its execution time measured 
on line [4]. Lines [5] and [6] measure 
the overhead cost of the diamond statement 
separator, and the last line subtracts 
this overhead from the total time and 
catenates the number of iterations. 


TIMES will help you evaluate alterna- 
tive formulations of a problem. Here is 
an example of a typical test (finding the 
unique values in a numeric vector): 


ALG1+«'R<«((A1A)=1pA)/A' 
ALG2«'R+ALAA] © Re(1,(14R) 24° 14R)/R' 


A+«?7920 

#\50 TIMES ALG1 © +\50 TIMES ALG2 
20 0.4 
38 0.76 


A+«?70p200 

#\50 TIMES ALG1 © +\50 TIMES ALG2 
115 2.3 
115 2.3 


A+?700p2000 

+\5 TIMES ALG1 © #\5 TIMES ALG2 
810 162 
115 23 


The first algorithm is faster for 
small vectors, but the second is substan- 
tially faster for large vectors. 


May 1977 
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USING BOOLEANS TO CONTROL COMPUTATION 


One of APL's unique contributions to 
computing languages is its ability to 
generate and manipulate Boolean arrays 
{those containing only 1's and O's). 
Although binary {or bit) datatype is 
available in many other languages, it is 
often clumsy to use, requiring conversion 
and indirect manipulation. APL, on the 
other hand, handles Boolean data directly 
and efficiently; the data is stored 
compactly (one value per bit) and pro- 
cessed rapidly. 


Boolean values are most often gener- 
ated by one of the six relational func- 
tions (< £ = 2 > #) or by epsilon (e€). 
Five other functions perform logical 
calculations: and (4), or (Vv), nand (*), 
nor (¥), not (~). Together, these provide 
all the nontrivial functions of logical 
calculus. For instance, exclusive-or is # 
and logical implication is s. 


But APL's strength with Boolean 
values lies in its ability to use them 
with every primitive function that allows 
numeric arguments. Some functions, such 
as compression (/) and expansion (\), are 
designed specifically for Boolean argu- 
ments; these allow us to select from, and 
insert values into, arrays. Other func- 
tions for which Boolean arguments are most 
useful are rotate (%), grade up (4), grade 
down (¥), decode (1), and several scalar 
dyadic functions (+ - * + | !). The 
reduction and scan operators also gain new 
Significance with Boolean arguments. 


Most uses of Boolean values deal with 
representing logical true-false condi- 
tions. For instance, 


V+?14p14 O V,[0.5] V>8 
8 711 5 912 8 3 6 114% 5 7 13 
o 0 10601 1 0060 00 1 0 060 1 


V>8 generates a Boolean vector (or bit 
vector) that asserts whether the corre- 
sponding element of VY is greater than 8; 
the result contains a 1 if the assertion 
is true and a 0 if it is false. We can 
use this result in a variety of ways: 


+/V>8 How many greater than 8? 
5 

A/V>8 All greater than 8? 
0 

v/V>8 Any greater than 8? 
1 

<\V>8 The first greater than 8. 
0010000000000 0 

(V>8)i1 What is its index? 
3 

(¥>8)/V All elements greater 
11 9 12 14 13 than 8. 


(V>8)/i1pV Indices of all ele- 
35 6 11 14 ments greater than 8. 


The entry VE¥V>8) moves all elements 
greater than 8 to the front of the vector: 
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VOVV>8] 
119121413875 83615 7 


Boolean values can also represent 
values in binary representation. The 
following expression gives the base-2 
representation of 165 in 8 bits: 

2 22522. 2-2 2 T4465 
1012003101 
We can convert back to the base-10 value: 


2210100101 
165 


With some imagination, we can also 
use Boolean values to control other 
calculations. A common operation is 
multiplication by a bit vector. For 
instance, assume we have a matrix REGS of 
sales region names and an associated 
vector MACC indicating how many new 
accounts each region has generated. We'd 
like to produce a new-account report. 


DISP+NACC>O Show only regions 
with new accounts. 
NA+NACCXNACC>1 Show the number only 


if it's more than 1. 


DISP/REGS,'PO (MQU)OBLI6' OFMT NA 
NEW ENGLAND (3) 
MIDDLE ATLANTIC (10) 
EAST NORTH CENTRAL 
BAST SOUTH CENTRAL (5) 
WEST SOUTH CENTRAL 
PACIFIC (2) 


(~DISP)#REGS 
SOUTH ATLANTIC 
WEST NORTH CENTRAL 
MOUNTAIN 


By using logical multiplication, we 
"zeroed out" WA for regions with only one 
new account, and used the B modifier to 
Suppress display. 


Another application is to return 0 
rather than 1 for 00: 


A+ 6705305007 
B+ 3402304 5 8 2 
AtB © At+BxBe0 
21.75 12.5 111.25 0 0 3.5 
21.75 0 2.5 101.25 0 0 3.5 


Here we used exponentiation by a logical 
vector to change 0's into 1's because 01 
will produce the desired 0. We could also 
have used A+#B+B=0 in this case. 


The following table (the first two 
rows were illustrated above) shows the 
most useful manipulations and what they 
produce when BOOLEAN is 0 or 1. 


Operation 
AEBOOLEAT 
A* BOOLEAN 
BOOLEAN!A 
BOOLEAN|A 
Ax~BOOLEAN 
A*~BOOLEAN 


(#) Only for integer 4. 


BOOLEAN+0 BOOLEAN+1 


(#) 


mnhmne HO 
POOhR » 
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Often when dealing with arrays, we 
wish to process part of the array while 
leaving the rest untouched. For instance, 
add 3 percent to only those bills over 30 
days old: 


BILLS*+BILLS*1.03*DAYSOLD>30 


Other times, we may wish to alter the 
order of the arguments of the computation 
itself based on a conditional array (e.g., 
A-B or B-A). For functions with an 
inverse, we may wish to conditionally 
alter the function used in a computation 
(e.g., A+B or A-B). If a function has an 


inverse, we can use 1 and one of x*|! on 
the conditional array to give us the 
proper answer. Or, as shown above, we can 


use the conditional array to control 
whether the operation is done or not. In 
the table below, C is the conditional 
array. C controls the function used in 
the first two lines, the order of the 
function's arguments in the next two 
lines, and whether or not the function is 
performed in the last three lines. 


If C=0 If c=1 
Return Return By Using This Expression 


A+B A-B A+B x 1*C 
AXB AtB AxB * 1*C 
A-B B-A (A-B)x°1*C 
AtB BtA (AtB)* 1%C 
A A+B A+BxC 
A AXB AXB*C 
A A*B A*B*C 


A more general way to interchange 
scalar function arguments conditionally is 
by reducing a two-row array that has been 
rotated by a conditional vector. For 
example, the following performs A*B if C=0 
and B*A if C=1: 


A,C1) B,{0.5] ¢ 
5 171 9 2 5 12 2 7 3 38 
1 61 8 14 4 227 5 6 2 4 4 


o 101 01 01 0 1 0 1 «0 
(1,pA)p*#COA,[0.5] B 
5 6 7 8 9 16 25 27 32 36 49 64 81 


Booleans can also be used to select 
one of two values via subscription. (Note 
that if the index origin is 0, the Boolean 
values can serve directly as indices, 
obviating the need for the +1.) 


Be 0101110013100 
7 110B+1) (or 7+Bx4 or 7+B\4) 
7 1417411 41141774111177 


This capability allows simple plotting: 


O+FREQ+8?42 

10 24 15 21 35 31 5 18 

' O'C1+FREQ°.211 /FREQ] 
OO00000000 
O0000000000000000 
OOO000000 
OOO0000000000000 
oo00000000000000000000000000o 


goo00000000 
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Another way to use Booleans is to 
control the merge order of two vectors: 


VOWELS+'AEIOUY' 
CONS+'FCTSL' 

MERGE+1 01010003110 
(pVOWELS)=+/~MERGE 


1 

(pCONS)=+/MERGE 
1 

(VOWELS ,CONS)CAAMERGE J 
FACETIOUSLY 


MERGE controls the selection of data from 
VOWELS (MERGE=0) or CONS (MERGE=1). 


Booleans can be used to control 
computations in many other ways. Bob 
Smith, known as the "Boolean Bomber" at 
Scientific Time Sharing Corporation, has 
done significant work in this area. His 
paper, "Boolean Functions and Techniques" 
is available from STSC as Working Memo- 
randum No. 106. 


July 1977 
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APL POTPOURRI 


(Note: All examples are in origin 1.) 


Did you know that ... ? 


''5A selects the first element of A 
as a Scalar. What happens if A is empty? 


3 5+SCALAR is acceptable, and 
creates a matrix with all zeros or blanks 
except for position [1;1] which is the 
Same value as SCALAR. 


(Np0)+SCALAR has the same effect as 
(Np1)pSCALAR or (Np1)+SCALAR -- it creates 
a Singleton of rank NW. 


A scalar is allowed as the right 
argument to compression and expansion. In 
each case, the scalar is first replicated 
to the appropriate length. Thus, B/SCALAR 
is identical to (+/B)pSCALAR. 


A+1/A changes A into a one-element 
vector if A was a scalar, but leaves all 
other arrays unchanged. 


O=1+0pA returns 1 if A is numeric, 
0 if A is character. 


A/,B=B=1 returns 1 if B is Boolean 
valued, 0 otherwise. A/,Be0 1 can also 
be used but is somewhat slower. 


You can demote a numeric array RF to 
its most compact internal representation 
with the following expression: 


A«pR © L+(x/A)p0 © LL IJ+,R © R<ApL 


Note particularly the third statement, a 
syntactic construction not often seen in 
APL programs. 


Applying reversal (${K]A) or rotation 
(BOCKI]A) along a nonexistent coordinate 
(that is, where ~KeippA) is acceptable and 
has no effect. This can be useful. For 
instance, VIO[K]¥V] sorts V in ascending 
order if K is 1, and in descending order 
otherwise. 


The expression +/[£2] 3 04 p5 
returns a 3 by 4 matrix of zeros, thereby 
actually increasing the storage require- 
ments. Can you guess what L/ 2 3 0 p6 
returns, and why? 


Relational scans (<\ <\ =\ 2\ >\ #\) 
do not always return Boolean values. For 
example, 


>\ 0.5 0.5 0.5 
0.501 


Any outer product can be rewritten as 
an inner product. For instance, Ae.+B can 
be restated as (((pA),1)p4)0.+(1,pB)pB 
where Oo can be any Scalar dyadic function. 
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Any reduction on numeric data can 
likewise be written as an inner product. 
For instance, x/[K]A can be restated as 
Ox. +(AAKZ#1p9A) QA. 


Base value (1) can be written as a 
+.x Matrix product. Thus 41B (without 
scalar or unitary coordinate extension on 
the left argument) can be restated as 


(Ox\ (pA) +1, 64) +.B. 


A+A is faster than 4x2, and AxA is 
faster than 4*2. Generally the simplest 
function is fastest; but the speed differ- 
ence is small, so use the most natural 
construct. 


The expression (1*pA)&A Selects the 
major diagonal of any array. 


'TO' or 'A0' or 'GMHM' can be used 
with OFMT to suppress display of unwanted 
columns. For example: 


'T0,22,A40,274,3¢CMM,F8.3' OFMT 4 8pi32 
2 4 8.000 
10 12 16.000 
18 20 24.000 
26 28 32.000 


Many experienced APL programmers 
cannot name all APL primitive scalar 


dyadic functions. Can you? (Hint: + is 
one of them, and there are twenty more.) 


September 1977 
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DATA SELECTION 


Selecting data from an array is one 
of the most common operations in APL. 
There are three primary facilities for APL 
data selection: 


e positional, using subscription (A[2B]) 
e conditional, using compression (B/[K]A) 
e bounded, using take (BtA) or drop (B+A) 


There are also several specialized 
techniques for frontal and diagonal 
selection using reshape (BpA) or transpose 
(B84). Besides selecting data, some of 
these functions can replicate data (sub- 
scription, compression of a scalar, and 
reshape), extend data (take), or reorgan- 
ize data (subscription, reshape, and 
transpose). This article discusses these 
techniques as applied to the matrix A in 
origin 1: 


OLO*1 & O*A*(10x15)¢.4+19 
1161213 14 15 16 17 18 19 
21 22 23 24 25 26 27 28 29 
31 32 33 34 35 36 37 38 39 
41 42 43 4H 45 46 47 4B 4G 
51 52 53 54 55 56 57 58 59 


Subscription is the most generalized 
selection function. It can be used to 
select, reorder, and replicate data. 


AC2 51391575 5] 
29 21 25 27 25 25 
59 51 55 57 55 55 
19 11 15 17 15 15 


Because of its flexibility, subscrip- 
tion is typically the slowest of all 
selection functions applied to a matrix. 
However, it alone can reorder or selec- 
tively replicate data, and it can be used 
to the left of assignment (+). In the 
absence of these requirements, subscrip- 
tion is best used only to select single 
rows or columns, or small subarrays. 
Subscription is somewhat faster on vectors 
than it is on matrices. 


Compression is less flexible than 
subscription; it can select only along a 
single coordinate, and it cannot replicate 
Or reorder data. However, it can select 
non-contiguous rows or columns in the same 
order they occur in the array. 


11001 4A 
11°12 13 14 15 16 17 18 19 
21 22 23 24 25 26 27 28 29 
51 52 53 54 55 56 57 58 59 


100010101 /A 
11°15 17 19 
21 25 27 29 
31 35 37 39 
41 45 47 49 
51 55 57 59 


Because of its more limited capabil- 
ity, compression is typically faster than 
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subscription. Of course, it is ideal when 
selecting on the basis of a Boolean vector 
from epsilon (e) or one of the relational 
(< < = 2 > #) or logical (A vn » ~) 
functions. As noted, compression can 
select along only one coordinate. Thus 
the matrix must be compressed twice in 
order to select both rows and columns. 


0110070001111 00 /A 
24 25 26 27 
34 35 36 37 


If indices are already provided for 
one coordinate, it is usually faster to 
use B/1pB and subscription. For instance, 


ALB/1pB; 5 4 7 6) 
should be used rather than 


BAAL3; 5 4 7 6) 


Take and drop are the most restric- 
tive selection functions. They allow only 
contiguous blocks of data to be selected, 
and they cannot replicate or reorder. 

They are quite fast, however, at selecting 
large blocks of data. One application of 
either function is sufficient to select 
any corner sSubarray. 


3 54+A a 2 444A 3. 5tA a 2 YA 
1112 13 14 15 15 16 17 18 19 
21 22 23 24 25 25 26 27 28 29 
31 32 33 34 35 35 36 37 38 39 

“3 5tA a 2 4A “3 "5tA a 2 4A 
31 32 33 34 35 35 36 37 38 39 
41 42 43 44 45 45 46 47 48 49 
51 52 53 54 55 55 56 57 58 59 


Any contiguous block of data can be 
selected by two applications of either 
function. The subarray 


A4(2 33; 4567) 
24 25 26 27 
34 35 36 37 


can be selected by each of 16 expressions: 


27-4 + 3 #7 4A 2 4+ 3 °6 +A 
“2 “h + ~2 "2 +A “2 4 4 °2 3 4A 
1 3+ 3 £7 +A 1°2 + 3 °6 +4 
1 3+ °2 °2 4A 1°24 ° 2 3 +A 
274 + “kh 7 4A 2 4 +H 6 +A 
24 + 1 °2 4A 2 4+ #1 3 -+4A 
“2 34 “4 7 4A “2 "2 + “4 “6 +A 
“2 34 #1 °2 4A “2 “2 4+ 1 3 4A 


Unless extension (padding with zeros or 
blanks) is needed, convenience should 
determine whether to use take or drop. 


Reshape and transpose are normally 
considered structural functions, but are 
often used for specialized selection. 
Reshape can select data from the front of 
an array. For instance, the following 
expression selects the first element as a 
scalar: 
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ttoA 
11 


The next expression selects the first row 
as a vector: 


SpA 
41.12 13 14 15 16 17 18 193 


Transpose can select the major 
diagonal, or the other diagonals if used 
with reversal. 


1 1 &A 
11 22 33 44 55 
1 1 QoA 
19 28 37 46 55 
1 1 eA 
51 42 33 24 15 
1 1 &de4 
59 48 37 26 15 


Both reshape and transpose are very fast. 


In general, the speed of all selec- 
tion functions depends on the amount of 
data being selected, not on the size of 
the original array. As more data is 
selected, more CRUs will be consumed. By 
avoiding superfluous data selection and by 
using the most specialized function appro- 
priate to your needs, you can increase the 
efficiency of your programs. 


November 1977 
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WHY NOT TO LOOP 


Most APL systems are implemented as 
interpreters. This allows users great 
flexibility in creating and modifying 
Programs. There is no need for compila- 
tion, and errors can be diagnosed and 
corrected during execution. 


The major penalty of an interpreter 
is the cost of syntax analysis and dynamic 
data checking. Fortunately, APL functions 
tend to be quite short relative to pro- 
grams in other languages, and therefore 
the interpretive overhead is minimized. 


On the other hand, when looping is 
performed in APL, the interpretive over- 
head skyrockets because the same code must 
be reanalyzed for each iteration. The 
general rule, therefore, is to try to 
minimize looping in APL. 


As anyone familiar with APL will 
attest, most programs can be written with 
no loops at all. APL's rich set of 
primitive functions and operators allows 
"closed form" (nonlooping) solutions to 
Many complex problems. The control 
structures that must be written explicitly 
in other languages (e.g., DO loops) are 
performed implicitly in APL primitives 
(e.g., Scan). But difficulties can arise 
when people who have been schooled in 
languages like FORTRAN, BASIC, and COBOL 
try to move their looping programs intact 
into APL. 


For example, the following portion of 
a FORTRAN program calculates cash flow for 
a series of 360 investments using variable 
interest rates. DEP is N+l deposits, the 
first being the initial amount; RATE is N 
interest rates between 0 and 1; CF is the 
result. 


DIMENSION CF (361) 
DIMENSION DEP (361) 
DIMENSION RATE (360) 


° 


CF (1)=DEP (1) 
DO 20 I=1,360 
20 CF(I+1)=DEP (I+1)+CF(I)* (RATE (I)+1.) 


This segment can be translated almost 
directly into the following APL program: 


V CF*RATE CASHFLOW DEP;I;N 
C1] CF+DEP 
[2] N«+pRATE 
[3] >N+0 © I*1 
Cal £20:CF(I+1)«<DEPLI+1IJ+CF(II]xRATECII]+1 
(5] +L20xN2Ie«I+1 
Vv 


and will run successfully, albeit slowly: 
(pDEP),pRATE a 30-YEAR MONTHLY LOAN 
361 360 


T+OAI © CF*RATE CASHFLOW DEP © QDAI-T 
0 2070 483 0 (O,MILLICRUS MILLISECONDS ,0) 
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More than 2 computer resource units are 
required to run CASHFLOW -- not a very 
cost-effective solution! 


Rather than entirely rethinking the 
problem, some users will take great pains 
to optimize the existing algorithm. This 
approach is fine if no alternative algo- 
rithms are considered, but it often 
results in a more obscure program: 


V CF*R CASHFLO2 D3I3d3N3;T 

{1] T+D(I+1] © D*14+CFr+D 

{2] ReR+1 

C3] +N+((pR)pL),0 

C4] b:7+CR(J+I+1)+D(I]+7TxR(I] © +N[I+d] 
v 


The techniques used in CASHFLO2 all 
serve to reduce interpretive overhead in 
the loop on line [4]. Interest rates R 
and branch targets W are calculated in 
advance, and subscript calculations are 
simplified. (Note that the use of one- 
letter variable names does not affect the 
cost of running the program.) 


T+OAI © CF+RATE CASHFLO2 DEP © QAI-T 
0 1301 330 0 


We've reduced the cost by one third -- a 
significant savings, but still not enough 
to be cost effective. 


By far the most effective approach is 
to reformulate the problem into an APL 
solution, rather than a warmed-over 
FORTRAN solution. Consider the following: 


VY CF+RATE FASTFLOW DEP 
{1] CF+1,x\RATE+1 © CFP*CPx+\DEPtCP 
V 


CF is set first to the cumulative present~— 
value discount factors for future depos- 
its. The deposits are divided by those 
factors, summed, and then multiplied by 
the factors to produce the result. A good 
exercise is to derive this algorithm from 
the looping algorithm; the key is that 
multiplication and division are both 
distributive with respect to addition 
(division is only right distributive). 


FASTFLOW is indeed an elegant and 
inexpensive solution to the problem: 


T+OAI © CF*+RATE FASTFLOW DEP © QAI-T 
0 26 7 0 


It performs 50 times faster than even the 
optimized looping solution, and illus- 


trates the benefits that can be derived 
from writing nonlooping APL code. 


January 1978 
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CHARACTER SEARCHES 


With the recent introduction of the 
system function QSs for character string 
searching, and the major speedups to 
character a.= and v.z, a variety of new 
techniques are now practical. Although 
these techniques were usable before the 
enhancements, they were often infeasible 
because of WS FULL conditions or the large 
number of computer resource units needed 
to accomplish them. Frequently, obscure 
or clumsy code was written to circumvent 
the previous snailish pace of a.=. 


For example, a typical table lookup 
MATRIXA.=VECTOR 

was occasionally coded as 
A/MATRIX=( pMATRIX) pVECTOR 


to reduce cost (at the risk of WS FULL). 
Now, the first expression is many times 
faster. 


The improvements to a.= also dramati- 
cally altered the relative speeds of the 
six different string-searching algorithms 
presented in "String Searching" beginning 
on page 21. In particular, SS2, SS3, and 
SS4 all use a.= for part or all of the 
search. Below is a table comparing their 
performance on 25 February 1977 with their 
current performance and with that of their 
UsS-equivalent (R+«(A OSS B)/1pA). The 
times are in milliseconds. 


B pR 25 FEB 1977 24 FEB 1978 

SUBSTRING HITS $52 $53 S54 552 553 554 055 
rt 3683 19 2 2 8 2 2 6 
teu 783 60 8 8 11 8 8 8 
‘w! 0 56 2 2 6 2 2 2 
§ , 342 105 27 28 10 18 pe | 7 
'AN' 33 104 10 9 7 8 8 2 
‘aw! 0 103 3 4 6 3 4 2 
' THE' 20 204 a5 9 8 24 7 & 
'THE °* 16 203 14 9 7 9 7 3 
'MATRIX' 5 300 15 § 8 9 6 2 
‘RATMIX' 0 307 23 5 8 12 6 2 
' PERMITS'* 15 403 113 17 10 Yi 10 5 
"PERMITS * 15 401 16 17 8 9 g 2 
*NOTFOUND' 0 399 22 19 8 10 10 2 
*age10up~a' 0 399 3 4 8 3 5 1 


(Executed 24 Feb 78 at 6:46 PM EST 
on APLPLUSC/470-10011 with 34 users) 


Although they still suffer a propensity 
for WS FULLS with large arguments, it's 
nice to know their cost has dropped so 

dramatically! 


Pleasantly, the discussion of string- 
search techniques is now moot because we 
have 08S. In fact, even if OSS weren't 
available, the following simple function 
would perform the same task: 
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VY R<A ASS B;C 
C1] RepA O C+p,B 
[2] R<R+(-C)+¥Ba.=(C,R+xR) pA 


V 


The function aSS takes roughly twice as 
long as QSS to perform a given search. 
Moreover, OSS has the advantage of 
requiring no additional or intermediate 
workspace storage. 


Not only does OSS provide cost and 
storage advantages, but it also allows 
novel or simpler approaches to problems. 
Below are a few examples. 


Determine if function DSPELL is currently 
suspended or pendent; that is, whether 
it's in the state indicator. 


FN«'DSPELL' 
OLD v/(((1tpOSI),1+p,FN)t0SI)A.=PN,'C! 
v/(((1+pOSl),7)t0SZ)A.='"DSPELLL 
NEW v/(,' ',O08r)0Sss ' ',Py,'C! 
v/(," ',OSZ)O0SS ' DSPELLC' 


Mark the line-ending carriage returns ina 
function. 


VR+OVR 'PNNAME' © VRCOIO-6-pVRI]<'C' 
OLD (VR=OPCNL)A1oVR='([! 
NEW vr Oss orcwe,'C[! 


And finally, here's a function that 
locates a string in any row of a matrix: 


VY R«M ROWSS V 
[1] R+v/(0,1-p,V)+¥(pM)p(,M) OSS V 
v 


OSs locates all occurrences in the rav- 
elled matrix, and its result is reshaped 
to the shape of the matrix. Then, those 
columns that might identify matches 
spanning more than one row are dropped. 
Finally, the v/ singles out the proper 
rowS. ROWSS will locate matches in a 
character matrix: 


pSTATES 
50 14 
(STATES ROWSS 'NIA')#STATES 
CALIFORNIA 
PENNSYLVANIA 
VIRGINIA 
WEST VIRGINIA 
(STATES ROWSS ‘'LIN')#STATES 
ILLINOIS 
NORTH CAROLINA 
SOUTH CAROLINA 
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BACK TO BASICS 


One of the beauties of APL is the 
ability to get useful results while using 
enly a small subset of the language. 
Gradually, through curiosity and experi- 
mentation, users discover more advanced 
features which they can apply to their 
problems. In the interest of hastening 
that discovery, I'd like to introduce you 
to a powerful function in APL called base 
value. I'1l discuss its performance on 
scalars and vectors only, leaving you to 
experiment with matrices if you wish. 


Base value, also called decode, is 
represented by the 1 symbol (shift B). It 
is dyadic and accepts only numeric argu- 
ments. Let's see what happens in a simple 
case. 


101 829 5 
8295 


The effect of "putting numbers together" 
is actually the result of evaluating the 
polynomial expression 


8x* + 2x7 + 9x + 5 


for x=10, or, for you APL'ers out there: 


X+10 
(8xX*3)+(2xX%2)+(9xX)4+5 
8295 
or 
. S4+Xx9O+Xx24+XxB 
8295 


Notice that the coefficients are on the 
right and the independent variable is on 
the left. Thus, base value evaluates 
Olynomials. This may seem esoteric, but 
1t actually proves quite useful. For 
example, you can use it to combine month, 
day, and year into one number, 


1001 6 5 77 wm (10000x6)+(100%5) +177 
60577 


or to determine the number of seconds in 2 
hours, 10 minutes, and 38 seconds. 


601 2 10 38 Am (3600x2)+(60x10)+1x38 
7838 


Note that the following three expres- 
zlons are equal: 


2+3x4 06 32 4290 41 3 2 
14 
14 
14 


From what you've seen so far, can you 
explain why 11VE£C is the same_as +/VEC, 
and why 01VEC is the same as 1+t+VEC? 


For scalars and vectors, AiB is 
equivalent to +/WxB (or W+.xB), where W is 
a vector of weights derived from A. As 
we've seen above, if the left argument is 
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a scalar, then (assuming QI20+1) W is 
calculated as follows: 


W+ Ax(pB)-1pB 
60* 3 ¥] 92-3 
60* 2 1 0 
3600 60 1 


More generally, if the left argument is a 
vector, 


A+ 0 7 24 60 60 1000 
Be 123 4 5 6 
ALB 

788645006 


base value is evaluating a mixed radix 
polynomial. In the example above we're 
calculating the number of milliseconds in 
l week, 2 days, 3 hours, 4 minutes, 

5 seconds, and 6 milliseconds. In this 
case the left argument represents: 


0 (placeholder) 
7 days per week 
24 hours per day 
60 minutes per hour 
60 seconds per minute 
1000 milliseconds per second 


The weighting vector is computed by 
performing a multiplication scan from 
right-to-left on all but the first element 
of the left argument, and catenating a 1 
to that result: 


WHO1,x\O14A 

W+o1,x\O140 7 24 60 60 1000 

W+o1,x\O7 24 60 60 1000 

W+o1,x\1000 60 60 24 7 

W+o1,1000 60000 3600000 86400000 604800000 

W+o1 1000 60000 3600000 86400000 604800000 

W+604800000 86400000 3600000 60000 1000 1 
week day hour min. sec. ms 
(milliseconds in) 


Hence we multiply the number of milli- 
seconds in a week by the number of weeks, 
add it to the number of milliseconds in a 
day multiplied by the number of days, and 
so on, to produce the result: 


+/WxB 
788645006 


If either argument is scalar, it is 
extended to the length of the other 
argument, and the same weighting vector 
calculation is used. Thus the following 
expressions are equivalent: 


12 12 12 1 2 


314 
1212 12 1222 
314 
121222 
3414 
60 1 10 0 120 
36120 
60 60 60 1 10 0 120 
36120 
0 60 60 1 10 0 120 
36120 
“743 60 60 1 10 0 120 
36120 


As illustrated, the first element of 
the left argument does not affect the 
result, but serves only as a placeholder. 
If either argument is not a scalar or one- 
element vector, the lengths of the argu- 
ments must match. 


60 60110 0 120 
LENGTH ERROR 
60 60 1 10 0 120 
A 


This rule is identical to that used by 
inner product (for example, +.x), although 
in inner product the first element of the 
left argument does affect the result. 


SOME USEFUL APPLICATIONS 
Right justify a character vector JV: 
(1-(V=" ')11)06V 


Actually, this expression will right 
justify an array of any rank. It uses the 
“reverse multiplication scan" weighting 
vector calculation as an and~scan (A\), 
and the +/Wx1 to calculate (1+) the number 
of contiguous ending spaces. 


"Why Not to Loop" beginning on page 
37 contains the function FASTFLOW to com- 
pute a cash flow series for a starting 
balance (SB) and WN subsequent deposits 
(DP), all compounded at varying interest 
rates (IR). 


V CF+*RATE FASTFLOW DEP 
[1] CF+1,x\RATE+1 © CF+CFx+\DEP*tCF 
Vv 
SB+ 100 
DP+ 0 50 100 216 0 123.26 
IR+ 0.2 0.25 0.08 0 0.05 0.06 
IR FASTFLOW SB,DP 
100 120 200 316 100 105 234.56 


If you don't need the intermediate values 
and only wish to know the ending balance, 
the following expression will suffice: 


(0, 2R+1)1S5B,DP 
234,56 


Given an array YM containing months 
packed as YYMM or YYYYMM, add A months and 
return the dates in the same format: 


V R+A MONTHADD YM 
C1] R+1+100110000 12TA+12110000 100TYM-1 
Vv 
7 MONTHADD 197805 7806 197912 
197812 7901 198007 
“5 “11 ~16 MONTHADD 197804 
197711 197705 197612 
(16) MONTHADD 197806+16 
197808 197810 197812 197902 197904 197906 


This is a lovely illustration of the 
relationship between decode (1) and encode 
(tT). It uses both to convert back and 
forth between decimal and base-12 repre- 
sentation. 
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